NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Enigma of Transcriptional Activation Domains

https://doi.org/10.1016/j.jmb.2024.168766

Erkine, Alexandre M; Oliveira, Marcos A; Class, Caleb A (November 2024, Journal of Molecular Biology)

Activation domains (ADs) of eukaryotic gene activators remain enigmatic for decades as short, extremely variable sequences which often are intrinsically disordered in structure and interact with an uncertain number of targets. The general absence of specificity increasingly complicates the utilization of the widely accepted mechanism of AD function by recruitment of coactivators. The long-standing enigma at the heart of molecular biology demands a fundamental rethinking of established concepts. Here, we review the experimental evidence supporting a novel mechanistic model of gene activation, based on ADs functioning via surfactant-like near-stochastic interactions with gene promoter nucleosomes. This new model is consistent with recent information-rich experimental data obtained using high-throughput synthetic biology and bioinformatics analysis methods, including machine learning. We clarify why the conventional biochemical principle of specificity for sequence, structures, and interactions fails to explain activation domain function. This perspective provides connections to the liquid-liquid phase separation model, signifies near-stochastic interactions as fundamental for the biochemical function, and can be generalized to other cellular functions.
more » « less
Free, publicly-accessible full text available November 1, 2025
Grammar rules and exceptions for the language of transcriptional activation domains

Cooper, David G; Erkina, Tamara Y; Broyles, Bradley K; Class, Caleb A; Erkine, Alexandre M (November 2024, iScience)

Free, publicly-accessible full text available November 1, 2025
Predicting transcriptional activation domain function using Graph Neural Networks

https://doi.org/10.1101/2024.05.08.593266

Farheen, Farhanaz; Broyles, Bradley K; Zhang, Yuanyuan; Ibtehaz, Nabil; Erkine, Alexandre M; Kihara, Daisuke (May 2024, bioRxiv)

Abstract Analysis of factors that lead to the functionality of transcriptional activation domains remains a crucial and yet challenging task owing to the significant diversity in their sequences and their intrinsically disordered nature. Almost all existing methods that have aimed to predict activation domains have involved traditional machine learning approaches, such as logistic regression, that are unable to capture complex patterns in data or plain convolutional neural networks and have been limited in exploration of structural features. However, there is a tremendous potential in the inspection of the structural properties of activation domains, and an opportunity to investigate complex relationships between features of residues in the sequence. To address these, we have utilized the power of graph neural networks which can represent structural data in the form of nodes and edges, allowing nodes to exchange information among themselves. We have experimented with two kinds of graph formulations, one involving residues as nodes and the other assigning atoms to be the nodes. A logistic regression model was also developed to analyze feature importance. For all the models, several feature combinations were experimented with. The residue-level GNN model with amino acid type, residue position, acidic/basic/aromatic property and secondary structure feature combination gave the best performing model with accuracy, F1 score and AUROC of 97.9%, 71% and 97.1% respectively which outperformed other existing methods in the literature when applied on the dataset we used. Among the other structure-based features that were analyzed, the amphipathic property of helices also proved to be an important feature for classification. Logistic regression results showed that the most dominant feature that makes a sequence functional is the frequency of different types of amino acids in the sequence. Our results consistent have shown that functional sequences have more acidic and aromatic residues whereas basic residues are seen more in non-functional sequences.
more » « less
Full Text Available
Activation of gene expression by detergent-like protein domains

https://doi.org/10.1016/j.isci.2021.103017

Broyles, Bradley K.; Gutierrez, Andrew T.; Maris, Theodore P.; Coil, Daniel A.; Wagner, Thomas M.; Wang, Xiao; Kihara, Daisuke; Class, Caleb A.; Erkine, Alexandre M. (September 2021, iScience)

Full Text Available
‘Nonlinear’ Biochemistry of Nucleosome Detergents

https://doi.org/10.1016/j.tibs.2018.09.006

Erkine, Alexandre M. (December 2018, Trends in Biochemical Sciences)

Full Text Available

Search for: All records